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BACKGROUND OF THE INVENTION 

Technical Field 

The present invention relates to the field of speech recognition, and more 
particularly, to speech-based user interfaces. 

Description of the Related Art 

Speech interfaces are frequently used in conjunction with database-driven 
systems to provide users with a speech-based method of searching for information. 
One common example of such a system Is a telephone directory system where a user 
can verbally specify an argument, such as a name, for which the speech-enabled 
system can search. Speech interfaces can work effectively in cases where a database 
search returns a single search result. If the search returns multiple search results, 
however, the search effectively fails unless the user is provided with an opportunity to 
select, or disambiguate, one search result from the multiple search results. 

Disambiguation within a visual environment can be accomplished with relative 
ease in comparison with disambiguation in an audio environment. In a visual 
environment, search results are displayed to the user so that the user then can make a 
selection. Disambiguation within a speech interface, however, can be problematic. 
Specifically, each search result is played to the user, data field by data field. Playing 
search results in this manner can result in a confusing and nonsensical playback of the 
search results. The search results effectively serve as long and confusing speech 
menu items which can be difficult for the user to remember when making a selection. 
Moreover, some data items can be unpronounceable by the speech interface. 

One solution has been to play only search result contents from predetermined 
data fields in an effort to reduce speech menu item size. If, however, the selected data 
field includes duplicate data items among the search results, the search results cannot 
be disambiguated by the predetermined data field. In that case, the user hears what 
can sound like duplicate speech menu items, despite the fact that each speech menu 
item conresponds to a different search result. 
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SUMMARY OF THE INVENTION 

The invention disclosed herein provides a method and apparatus for 
disambiguating multiple database search results. More specifically, the invention can 
analyze database search results to determine a data field suitable for uniquely 
identifying each search result when presented through a speech interface. Accordingly, 
users can select a desired search result without having to view a listing of the search 
results on a display. 

One aspect of the present invention can include a method of disambiguating 
database search results. The method can include retrieving multiple database entries 
responsive to a database search. The retrieved database entries can include a plurality 
of common data fields. The retrieved database entries can be processed according to 
predetermined speech interface criteria. For example, data fields of the retrieved 
database entries which have common data items can be excluded from further 
processing, data fields of the retrieved database entries having pronounceable data 
items can be identified, and a data field from the plurality of common data fields having 
data items with a smallest average length can be determined. Additionally, a data field 
from the plurality of common data fields having data items which do not exceed a 
predetermined maximum threshold can be determined. Regardless, at least one data 
field from the plurality of common data fields can be selected for uniquely identifying 
each of the retrieved database entries. The data items corresponding to the selected 
data field for each retrieved database entry then can be presented through a speech 
interface. 

Another aspect of the present invention can include a method of disambiguating 
database search results wherein multiple database entries can be retrieved responsive 
to a database search. The retrieved database entries can include a plurality of 
common data fields. The retrieved database entries can be processed according to 
predetermined speech interface criteria; and, at least one of the data fields from the 
plurality of common data fields can be selected which can uniquely identify each 
retrieved database entry. A query can be issued, for example to a user, to determine 
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which one of the common data fields, which can uniquely identify each of the retrieved 
database entries, is to be used to disambiguate the retrieved database entries. 

The method further can include receiving a user input selecting one of the 
common fields which can uniquely identify each of the retrieved database entries. 
Another user input specifying a data item associated with the selected data field can be 
received to disambiguate the retrieved database entries. Alternatively, data items 
associated with the selected data field automatically can be presented through a 
speech interface for each retrieved database entry. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

There are shown in the drawings embodiments which are presently preferred, It 
being understood, however, that the invention is not so limited to the precise 
arrangements and instrumentalities shown. 

Figure 1 is an excerpt from an exemplary database after having performed a 
database search. 

Figure 2 is a flow chart illustrating a method of disambiguating multiple database 
entries In accordance with the inventive arrangements disclosed herein. 
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DETAILED DESCRIPTION OF THE INVENTION 

The invention disclosed lierein provides a metliod and apparatus of 
disambiguating multiple database search results. More specifically, the present 
invention facilitates the presentation of multiple database search results through a 
speech Interface such as a text-to-speech system. A suitable data field from the search 
results, which can be used to distinguish or disambiguate one search result from 
another, automatically can be determined. Through an analysis of the data fields and 
the contents of the data fields, one of the data fields of the search results can be 
selected as a disambiguation data field. The contents of the disambiguation data field 
then can be presented to a user through the speech interface so that the user can 
select the desired database search result. Accordingly, users can select a desired 
search result without having to view a listing of the search results on a display. 

Those skilled in the art will recognize that the present invention can be used with 
any of a variety of speech-enabled systems which incorporate database and database 
search functions. Accordingly, although a speech-enabled directory assistance 
program has been used for purposes of illustration, the present invention is not so 
limited to the particular examples and embodiments as disclosed herein. 

Figure 1 illustrates a series of database entries identified responsive to a user 
search. A header row (top row) listing the data field names of the database and an 
analysis row (bottom row) have been included for purposes of illustration. Responsive 
to a database search for the name "Joe Smith" in the "Name" data field, the search has 
retrieved eight results. Each search result includes the name "Joe Smith" within the 
"Name" field of the database entry. As shown, the database entries include additional 
data fields such as "Formal Name", "Phone", "Location", "Job Description", "Dept. 
Number", and "Dept. Name". The present invention can process the search results, in 
accordance with predetermined criteria, to select a disambiguation field which includes 
data that not only can be properly played through a speech interface, but also can 
uniquely identify each of the entries. 

In selecting a disambiguation data field, the search results can be processed to 
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determine whether any of the data fields include duplicate data items. Data fields 
containing duplicate data items cannot uniquely identify each of the search results. 
Accordingly, these data fields can be failed as disambiguation data fields and excluded 
from further processing. For example, the "Name" and "Formal Name" data fields 
include duplicate items. These data fields can be excluded from further processing 
because neither field can uniquely identify each of the search result entries 1-8. 
Specifically, entries 1-8 each include the identical data item "Joe Smith" within the 
"Name" field. Entries 3, 7, and 8 include the identical data item "Joseph R. Smith" 
within the "Forma! Name" data field. Notably, the "Formal Name" data field also 
includes other duplicate data items such as "Joe Smith". 

The search results further can be processed to determine whether the data items 
within the data fields accurately can be pronounced through a speech interface. Those 
skilled in the art will recognize that this determination can be made using any of a 
variety of techniques such as using a dictionary to lookup data items or analyzing the 
patterns of vowels and consonants of the data items. As shown in Figure 1 , the data 
items within the "Phone" and "Dept. Number" data fields have been failed as possible 
disambiguation data fields because a determination has been made that one or more of 
the data items cannot be pronounced by the speech interface. In this case, these data 
fields include alphanumeric combinations rather than text words. Still, data fields can 
include text words which the speech interface is unable to pronounce. Such is the 
case, for example, if the text words are not included within a dictionary or include 
consonant-vowel combinations which are not specified in the speech interface. In any 
event, data fields Including data items which cannot be pronounced can be excluded or 
failed as possible disambiguation fields. 

The search results also can be processed to determine whether the lengths of 
the data items within the data item fields exceed a predetermined maximum length. 
Lengths of data items can be specified using any appropriate unit of measure such as 
characters, syllables, or words. The maximum length can be determined from an 
empirical analysis of the relative ease with which users recall and pronounce speech 
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menu items of various lengtfis. If the data items for a particular data field exceed the 
maximum length, the data field can be excluded or failed as a disambiguation field. 
The length detemiination can be performed on a per data Item basis such that if any 
one of the data Items exceeds the maximum length, the data field can be excluded or 
failed as a disambiguation field. Alternatively, different statistical measures of the data 
item lengths can be calculated. Accordingly, as shown, the "Dept. Name" data field has 
been excluded because the data items are too long. 

If the data items do not exceed the maximum length, the processing can 
continue. As shown, an average character length and syllable length from the data 
items in the "Location" data field and the "Job Description" data field have been 
detemiined. From these two data fields, the data field having the data items of the 
shortest average length can be selected as the disambiguation data field. Accordingly, 
the data items of the disambiguation data field can be presented through the speech 
interface as selections. For example, the directory assistance system can query the 
user as follows: "Found eight matches for this name, Please choose a location: United 
Kingdom, Las Vegas, West Palm Beach, Hursley, Chicago, Poughkeepsie, Austin, or 
Tucson". 

Figure 2 is a flow chart illustrating a method 200 of disambiguating multiple 
database entries. The method 200 can begin in step 205 where a user can initiate a 
database search for a particular data item. For example, a user can initiate a search for 
the name of a person such as "Joe Smith", a telephone number, or other data item for 
which the underlying database can be searched. 

In step 210, if the database search does not retrieve any results, the search was 
unsuccessful. Accordingly, the method can continue to step 240 and end. If the 
database search retrieves a single record, then the search is successful. In that case, 
the method can continue to step 235 where a specified action can be taken such as 
providing the telephone number through a speech interface or connecting the user to 
the telephone number specified in the database search result. 

If, however, the database search reveals more than one search result, the 
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method can continue to step 215. In step 215, the search result data fields and 
contents, or data items, of the data fields can be processed. As previously mentioned, 
the data fields can be analyzed to detemnine data field contents which can be played 
through a speech interface and can uniquely identify each of the database search 
results. Accordingly, the processing can include determining whether data items of a 
data field can be pronounced by the speech interface, whether a data field uniquely 
identifies each search result (i.e. does not contain duplicate data items), as well as 
determining the individual lengths of data items and the average length of the data 
items within a particular data field. Data fields that comply with the aforementioned 
requirements can be identified and data fields that do not comply can be excluded from 
further processing. After completion of step 215, the method can continue to step 220. 

In step 220, a data field whose contents can be played through a speech 
interface and which can be used to uniquely identify one of the entries from the search 
results can be selected automatically. For example, referring to Figure 1, the "Location" 
data field can be selected as the disambiguation data field. Notably, the "Location" 
data field also includes data items having the smallest average length. In step 225, the 
search results can be presented to the user. In particular, each data item 
corresponding to the disambiguation data field of the search results can be played 
through the speech interface. As mentioned, a directory assistance system can query 
the user as follows: "Found eight matches for this name, Please choose a location: 
United Kingdom, Las Vegas, West Palm Beach, Hursley, Chicago, Poughkeepsie, 
Austin, or Tucson". 

In an alternative embodiment of the invention, in the event that several 
disambiguation data fields are determined, the user can be queried as to which data 
field should be used. For example, referring to Figure 1 , the user can be notified that 
both the "Location" and the "Job Description" data fields can uniquely identify any one 
of the retrieved search results. Accordingly, the user can be queried to specify a 
particular data field and data item associated with that data field. Thus, if the user 
knows Joe Smith's job description or the location in which Joe Smith works, the user 
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can specify the appropriate data field. The user also can provide the appropriate 
location or job description for Joe Smith as the case may be. For example, the user 

can state "location Chicago" or "Job Description . . . Programmer". In another 

aspect of the invention, once the user specifies a preferred data field, the data items 
associated with the data field can be provided to the user automatically. In any event, 
after completion of step 225, the method can continue to step 230. 

In step 230, the user can select one of the presented choices. For example, the 
user can say "Chicago", press a number conresponding to the selection, or say a 
nunrrber corresponding to the selection. If the user selects one of the presented 
choices, the search is successful and the method can continue to step 235. In step 
235, any of a variety of specified actions can be taken such as providing the user with 
particular data items through the speech interface or connecting the user to the 
telephone number specified in the database entry. If, however, the user does not select 
one of the choices presented in step 230, the search is unsuccessful and the method 
can continue to step 240 and end. 

The present invention can be realized in hardware, software, or a combination of 
hardware and software. The present invention can be realized in a centralized fashion 
in one computer system, or in a distributed fashion where different elements are spread 
across several interconnected computer systems. Any kind of computer system or 
other apparatus adapted for carrying out the methods described herein is suited. A 
typical combination of hardware and software can be a general purpose computer 
system with a computer program that, when being loaded and executed, controls the 
computer system such that it carries out the methods described herein. 

The present invention also can be embedded in a computer program product, 
which comprises all the features enabling the implementation of the methods described 
herein, and which when loaded in a computer system is able to carry out these 
methods. Computer program in the present context means any expression, in any 
language, code or notation, of a set of instructions intended to cause a system having 
an infomiation processing capability to perform a particular function either directly or 
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after either or both of the following: a) conversion to another language, code or 
notation; b) reproduction in a different material forni. 

This invention can be embodied In other forms without departing from the spirit 
or essential attributes thereof. Accordingly, reference should be made to the following 
claims, rather than to the foregoing specification, as indicating the scope of the 
invention. 
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